Pesquisa | Portal Regional da BVS

Proteomics Standards Initiative at Twenty Years: Current Activities and Future Work.

Deutsch, Eric W; Vizcaíno, Juan Antonio; Jones, Andrew R; Binz, Pierre-Alain; Lam, Henry; Klein, Joshua; Bittremieux, Wout; Perez-Riverol, Yasset; Tabb, David L; Walzer, Mathias; Ricard-Blum, Sylvie; Hermjakob, Henning; Neumann, Steffen; Mak, Tytus D; Kawano, Shin; Mendoza, Luis; Van Den Bossche, Tim; Gabriels, Ralf; Bandeira, Nuno; Carver, Jeremy; Pullman, Benjamin; Sun, Zhi; Hoffmann, Nils; Shofstahl, Jim; Zhu, Yunping; Licata, Luana; Quaglia, Federica; Tosatto, Silvio C E; Orchard, Sandra E.

J Proteome Res ; 22(2): 287-301, 2023 02 03.

Artigo em Inglês | MEDLINE | ID: mdl-36626722

RESUMO

The Human Proteome Organization (HUPO) Proteomics Standards Initiative (PSI) has been successfully developing guidelines, data formats, and controlled vocabularies (CVs) for the proteomics community and other fields supported by mass spectrometry since its inception 20 years ago. Here we describe the general operation of the PSI, including its leadership, working groups, yearly workshops, and the document process by which proposals are thoroughly and publicly reviewed in order to be ratified as PSI standards. We briefly describe the current state of the many existing PSI standards, some of which remain the same as when originally developed, some of which have undergone subsequent revisions, and some of which have become obsolete. Then the set of proposals currently being developed are described, with an open call to the community for participation in the forging of the next generation of standards. Finally, we describe some synergies and collaborations with other organizations and look to the future in how the PSI will continue to promote the open sharing of data and thus accelerate the progress of the field of proteomics.

Assuntos

Proteoma , Proteômica , Humanos , Padrões de Referência , Vocabulário Controlado , Espectrometria de Massas , Bases de Dados de Proteínas

Universal Spectrum Identifier for mass spectra.

Deutsch, Eric W; Perez-Riverol, Yasset; Carver, Jeremy; Kawano, Shin; Mendoza, Luis; Van Den Bossche, Tim; Gabriels, Ralf; Binz, Pierre-Alain; Pullman, Benjamin; Sun, Zhi; Shofstahl, Jim; Bittremieux, Wout; Mak, Tytus D; Klein, Joshua; Zhu, Yunping; Lam, Henry; Vizcaíno, Juan Antonio; Bandeira, Nuno.

Nat Methods ; 18(7): 768-770, 2021 07.

Artigo em Inglês | MEDLINE | ID: mdl-34183830

RESUMO

Mass spectra provide the ultimate evidence to support the findings of mass spectrometry proteomics studies in publications, and it is therefore crucial to be able to trace the conclusions back to the spectra. The Universal Spectrum Identifier (USI) provides a standardized mechanism for encoding a virtual path to any mass spectrum contained in datasets deposited to public proteomics repositories. USI enables greater transparency of spectral evidence, with more than 1 billion USI identifications from over 3 billion spectra already available through ProteomeXchange repositories.

Assuntos

Bases de Dados de Proteínas , Espectrometria de Massas/métodos , Proteômica/métodos , Processamento de Sinais Assistido por Computador , Software , Algoritmos

ThermoRawFileParser: Modular, Scalable, and Cross-Platform RAW File Conversion.

Hulstaert, Niels; Shofstahl, Jim; Sachsenberg, Timo; Walzer, Mathias; Barsnes, Harald; Martens, Lennart; Perez-Riverol, Yasset.

J Proteome Res ; 19(1): 537-542, 2020 01 03.

Artigo em Inglês | MEDLINE | ID: mdl-31755270

RESUMO

The field of computational proteomics is approaching the big data age, driven both by a continuous growth in the number of samples analyzed per experiment as well as by the growing amount of data obtained in each analytical run. In order to process these large amounts of data, it is increasingly necessary to use elastic compute resources such as Linux-based cluster environments and cloud infrastructures. Unfortunately, the vast majority of cross-platform proteomics tools are not able to operate directly on the proprietary formats generated by the diverse mass spectrometers. Here, we present ThermoRawFileParser, an open-source, cross-platform tool that converts Thermo RAW files into open file formats such as MGF and the HUPO-PSI standard file format mzML. To ensure the broadest possible availability and to increase integration capabilities with popular workflow systems such as Galaxy or Nextflow, we have also built Conda package and BioContainers container around ThermoRawFileParser. In addition, we implemented a user-friendly interface (ThermoRawFileParserGUI) for those users not familiar with command-line tools. Finally, we performed a benchmark of ThermoRawFileParser and msconvert to verify that the converted mzML files contain reliable quantitative results.

Assuntos

Biologia Computacional/métodos , Proteômica/métodos , Software , Bases de Dados de Proteínas , Proteínas de Saccharomyces cerevisiae/metabolismo , Fluxo de Trabalho

Proteomics Standards Initiative Extended FASTA Format.

Binz, Pierre-Alain; Shofstahl, Jim; Vizcaíno, Juan Antonio; Barsnes, Harald; Chalkley, Robert J; Menschaert, Gerben; Alpi, Emanuele; Clauser, Karl; Eng, Jimmy K; Lane, Lydie; Seymour, Sean L; Sánchez, Luis Francisco Hernández; Mayer, Gerhard; Eisenacher, Martin; Perez-Riverol, Yasset; Kapp, Eugene A; Mendoza, Luis; Baker, Peter R; Collins, Andrew; Van Den Bossche, Tim; Deutsch, Eric W.

J Proteome Res ; 18(6): 2686-2692, 2019 06 07.

Artigo em Inglês | MEDLINE | ID: mdl-31081335

RESUMO

Mass-spectrometry-based proteomics enables the high-throughput identification and quantification of proteins, including sequence variants and post-translational modifications (PTMs) in biological samples. However, most workflows require that such variations be included in the search space used to analyze the data, and doing so remains challenging with most analysis tools. In order to facilitate the search for known sequence variants and PTMs, the Proteomics Standards Initiative (PSI) has designed and implemented the PSI extended FASTA format (PEFF). PEFF is based on the very popular FASTA format but adds a uniform mechanism for encoding substantially more metadata about the sequence collection as well as individual entries, including support for encoding known sequence variants, PTMs, and proteoforms. The format is very nearly backward compatible, and as such, existing FASTA parsers will require little or no changes to be able to read PEFF files as FASTA files, although without supporting any of the extra capabilities of PEFF. PEFF is defined by a full specification document, controlled vocabulary terms, a set of example files, software libraries, and a file validator. Popular software and resources are starting to support PEFF, including the sequence search engine Comet and the knowledge bases neXtProt and UniProtKB. Widespread implementation of PEFF is expected to further enable proteogenomics and top-down proteomics applications by providing a standardized mechanism for encoding protein sequences and their known variations. All the related documentation, including the detailed file format specification and example files, are available at http://www.psidev.info/peff .

Assuntos

Proteômica/normas , Humanos , Armazenamento e Recuperação da Informação , Espectrometria de Massas , Software

Expanding the Use of Spectral Libraries in Proteomics.

Deutsch, Eric W; Perez-Riverol, Yasset; Chalkley, Robert J; Wilhelm, Mathias; Tate, Stephen; Sachsenberg, Timo; Walzer, Mathias; Käll, Lukas; Delanghe, Bernard; Böcker, Sebastian; Schymanski, Emma L; Wilmes, Paul; Dorfer, Viktoria; Kuster, Bernhard; Volders, Pieter-Jan; Jehmlich, Nico; Vissers, Johannes P C; Wolan, Dennis W; Wang, Ana Y; Mendoza, Luis; Shofstahl, Jim; Dowsey, Andrew W; Griss, Johannes; Salek, Reza M; Neumann, Steffen; Binz, Pierre-Alain; Lam, Henry; Vizcaíno, Juan Antonio; Bandeira, Nuno; Röst, Hannes.

J Proteome Res ; 17(12): 4051-4060, 2018 12 07.

Artigo em Inglês | MEDLINE | ID: mdl-30270626

RESUMO

The 2017 Dagstuhl Seminar on Computational Proteomics provided an opportunity for a broad discussion on the current state and future directions of the generation and use of peptide tandem mass spectrometry spectral libraries. Their use in proteomics is growing slowly, but there are multiple challenges in the field that must be addressed to further increase the adoption of spectral libraries and related techniques. The primary bottlenecks are the paucity of high quality and comprehensive libraries and the general difficulty of adopting spectral library searching into existing workflows. There are several existing spectral library formats, but none captures a satisfactory level of metadata; therefore, a logical next improvement is to design a more advanced, Proteomics Standards Initiative-approved spectral library format that can encode all of the desired metadata. The group discussed a series of metadata requirements organized into three designations of completeness or quality, tentatively dubbed bronze, silver, and gold. The metadata can be organized at four different levels of granularity: at the collection (library) level, at the individual entry (peptide ion) level, at the peak (fragment ion) level, and at the peak annotation level. Strategies for encoding mass modifications in a consistent manner and the requirement for encoding high-quality and commonly seen but as-yet-unidentified spectra were discussed. The group also discussed related topics, including strategies for comparing two spectra, techniques for generating representative spectra for a library, approaches for selection of optimal signature ions for targeted workflows, and issues surrounding the merging of two or more libraries into one. We present here a review of this field and the challenges that the community must address in order to accelerate the adoption of spectral libraries in routine analysis of proteomics datasets.

Assuntos

Bases de Dados de Proteínas/normas , Biblioteca de Peptídeos , Proteômica/métodos , Animais , Humanos , Espectrometria de Massas em Tandem/métodos , Fluxo de Trabalho

TraML--a standard format for exchange of selected reaction monitoring transition lists.

Deutsch, Eric W; Chambers, Matthew; Neumann, Steffen; Levander, Fredrik; Binz, Pierre-Alain; Shofstahl, Jim; Campbell, David S; Mendoza, Luis; Ovelleiro, David; Helsens, Kenny; Martens, Lennart; Aebersold, Ruedi; Moritz, Robert L; Brusniak, Mi-Youn.

Mol Cell Proteomics ; 11(4): R111.015040, 2012 Apr.

Artigo em Inglês | MEDLINE | ID: mdl-22159873

RESUMO

Targeted proteomics via selected reaction monitoring is a powerful mass spectrometric technique affording higher dynamic range, increased specificity and lower limits of detection than other shotgun mass spectrometry methods when applied to proteome analyses. However, it involves selective measurement of predetermined analytes, which requires more preparation in the form of selecting appropriate signatures for the proteins and peptides that are to be targeted. There is a growing number of software programs and resources for selecting optimal transitions and the instrument settings used for the detection and quantification of the targeted peptides, but the exchange of this information is hindered by a lack of a standard format. We have developed a new standardized format, called TraML, for encoding transition lists and associated metadata. In addition to introducing the TraML format, we demonstrate several implementations across the community, and provide semantic validators, extensive documentation, and multiple example instances to demonstrate correctly written documents. Widespread use of TraML will facilitate the exchange of transitions, reduce time spent handling incompatible list formats, increase the reusability of previously optimized transitions, and thus accelerate the widespread adoption of targeted proteomics via selected reaction monitoring.

Assuntos

Sistemas de Informação , Proteômica , Software

mzML--a community standard for mass spectrometry data.

Martens, Lennart; Chambers, Matthew; Sturm, Marc; Kessner, Darren; Levander, Fredrik; Shofstahl, Jim; Tang, Wilfred H; Römpp, Andreas; Neumann, Steffen; Pizarro, Angel D; Montecchi-Palazzi, Luisa; Tasman, Natalie; Coleman, Mike; Reisinger, Florian; Souda, Puneet; Hermjakob, Henning; Binz, Pierre-Alain; Deutsch, Eric W.

Mol Cell Proteomics ; 10(1): R110.000133, 2011 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-20716697

RESUMO

Mass spectrometry is a fundamental tool for discovery and analysis in the life sciences. With the rapid advances in mass spectrometry technology and methods, it has become imperative to provide a standard output format for mass spectrometry data that will facilitate data sharing and analysis. Initially, the efforts to develop a standard format for mass spectrometry data resulted in multiple formats, each designed with a different underlying philosophy. To resolve the issues associated with having multiple formats, vendors, researchers, and software developers convened under the banner of the HUPO PSI to develop a single standard. The new data format incorporated many of the desirable technical attributes from the previous data formats, while adding a number of improvements, including features such as a controlled vocabulary with validation tools to ensure consistent usage of the format, improved support for selected reaction monitoring data, and immediately available implementations to facilitate rapid adoption by the community. The resulting standard data format, mzML, is a well tested open-source format for mass spectrometer output files that can be readily utilized by the community and easily adapted for incremental advances in mass spectrometry technology.

Assuntos

Bases de Dados de Proteínas/normas , Espectrometria de Massas/métodos , Espectrometria de Massas/normas , Software/normas , Padrões de Referência , Reprodutibilidade dos Testes

The PSI-MOD community standard for representation of protein modification data.

Montecchi-Palazzi, Luisa; Beavis, Ron; Binz, Pierre-Alain; Chalkley, Robert J; Cottrell, John; Creasy, David; Shofstahl, Jim; Seymour, Sean L; Garavelli, John S.

Nat Biotechnol ; 26(8): 864-6, 2008 Aug.

Artigo em Inglês | MEDLINE | ID: mdl-18688235

Assuntos

Bases de Dados de Proteínas/normas , Informática/normas , Processamento de Proteína Pós-Traducional , Proteômica/normas , Biologia Computacional/métodos , Biologia Computacional/normas , Espectrometria de Massas/métodos , Espectrometria de Massas/normas , Proteômica/métodos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA